A Model and Solution to the DNA ipping string problem
نویسندگان
چکیده
We consider the case where a pool of DNA molecules clones both, ipped and not-ipped, have been cut by restriction enzymes. Ideally, each clone is cut in the same positions, although in practice due to errors, this does not always happen. The computational problem is to determine where the cuts have occurred. This is a key problem in determining the structure of the original DNA molecule. A single molecule is represented by a string of 1's and 0's, with cuts represented by 1 0 s. A set of molecules clones (with errors) is observed, but the orientation/parity of each molecule is unknown. Clear is that the location of the observed cuts of one molecule are dependent on the parity: ipping the molecule would result in the cuts location, as observed, being \\ipped". We propose a Bayesian approach to generate a posterior distribution on the cuts and parity, given the data. We rst present an approximate algorithm where we attempt to divide the problem into subproblems, but it is not guaranteed to solve the problem. Then, we propose another approximate method based on a statistical framework and a mean eld annealing algorithm. It computes the maximum posterior marginal (MPM estimator) and maximum aposteriori estimate (MAP estimator). We also provide evidence that the exact solution of the problem is intractable.
منابع مشابه
THE PC-TAU METHOD ON THE SOLUTION OF A MODEL OF AN ISOLATED COSMIC STRING
In this paper, we adapt the operational Tau Method for personal computers and apply it to a system of two nonlinear second order ordinary differential equations which are related to general relativity. The interesting behaviour this problem exhibits in its numerical treatment is discussed. In this problem, we try to use the Tau perturbation term to locate the correct solution of this nonlin...
متن کاملModeling of Partial Digest Problem as a Network flows problem
Restriction Site Mapping is one of the interesting tasks in Computational Biology. A DNA strand can be thought of as a string on the letters A, T, C, and G. When a particular restriction enzyme is added to a DNA solution, the DNA is cut at particular restriction sites. The goal of the restriction site mapping is to determine the location of every site for a given enzyme. In partial digest metho...
متن کاملانتخاب کوچکترین ابر رشته در DNA با استفاده از الگوریتم ازدحام ذرّات
A DNA string can be supposed a very long string on alphabet with 4 letters. Numerous scientists attempt in decoding of this string. since this string is very long , a shorter section of it that have overlapping on each other will be decoded .There is no information for the right position of these sections on main DNA string. It seems that the shortest string (substring of the main DNA string) i...
متن کاملOptimal Cell Ipping to Minimize Channel Density in Vlsi Design and Pseudo-boolean Optimization 1
Cell ipping in VLSI design is an operation in which some of the cells are replaced with their \mirror images" with respect to a vertical axis, while keeping them in the same slot. After the placement of all the cells, one can apply cell ipping in order to further decrease the total area, approximating this objective by minimizing total wire length, channel width, etc. However, nding an optimal ...
متن کاملAsymptotic Approximations of the Solution for a Traveling String under Boundary Damping
Transversal vibrations of an axially moving string under boundary damping are investigated. Mathematically, it represents a homogenous linear partial differential equation subject to nonhomogeneous boundary conditions. The string is moving with a relatively (low) constant speed, which is considered to be positive. The string is kept fixed at the first end, while the other end is tied with the ...
متن کامل